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REMARKS 

Applicants have cancelled Claim 27 without prejudice to, or disclaimer of, the subject 
matter contained therein. Applicants maintain that the cancellation of a claim makes no 
admission as to its patentability and reserve the right to pursue the subject matter of the cancelled 
claim in this or any other patent application. 

Applicants have amended Claim 22 to read "specifically binds." Applicants maintain that 
the amendments add no new matter and are fully supported by the specification as originally 
filed. For example, support for the amendment to Claim 22 can be found in the substitute 
specification in original Claim 18. 

Claims 22-26 are presented for examination. Applicants respond below to the specific 
rejections raised by the Examiner in the Office Action mailed September 8, 2004. For the 
reasons set forth below, Applicants respectfully traverse. 

Rejection under 35 U.S.C. S101 - Utility 

The PTO has rejected the pending claims under 35 U.S.C. § 101 as lacking patentable 
utility. The PTO concedes that the cited utilities are credible. However, the PTO alleges that the 
invention lacks both substantial and specific utility. Applicants respectfully disagree. 

Substantial Utility 

The PTO argues that the invention lacks substantial utility because the level of 
overexpression in cancer cells of the nucleic acid which encodes the PR0539 protein was 
minimal, and there is no evidence that overexpression is significant or a real effect and not 
simply produced by chance. In addition, the PTO argues that the invention lacks utility because 
the overexpression of the nucleic acid is not relevant to the utility of the protein and antibodies 
and there is no evidence that the protein is overexpressed. The PTO cites three references to 
support its position that there is no necessary correlation between gene amplification, gene 
expression, and protein expression. The PTO concludes that because there is no necessary 
connection between the amount of DNA in a cell and the amount of mRNA, and no necessary 
connection between the level of protein in a cell and the amount of mRNA, any evidence of 
overexpression of one component does not provide utility for the protein. The PTO argues that 
the current situation closely tracks Example 12 of the Utility Guidelines, because where there is 
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no necessary relationship between the protein levels or utilities and a small level of mRNA 
overexpression in cancer cells, the invention lacks any "real world" context of use for PR0539. 

Utility need NOT be Proved to an Absolute Certainty - a Correlation between the Evidence and 

the Asserted Utility is Sufficient 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 USPQ 592, 596 (Fed. Cir. 1983) cert, denied, 469 US 835 (1984). The evidentiary 

standard to be used throughout ex parte examination in setting forth a rejection is a 

preponderance of the evidence, or "more likely than not" standard. In re Oetiker, 977 F.2d 1443, 

1445, 24 USPQ2d 1443, 1444 (Fed. Cir. 1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not . M.P.E.P. at § 2107.02, part VII (2004) (emphasis in 
original, internal citations omitted). 

The PTO has the initial burden to offer evidence "that one of ordinary skill in the art 
would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 
1436 (Fed. Cir. 1995). Only then does the burden shift to the Applicant to provide rebuttal 
evidence. Id. As stated in the M.P.E.P., such rebuttal evidence does not need to absolutely prove 
that the asserted utility is real. Rather, the evidence only needs to be reasonably indicative of the 
asserted utility. 

In Fujikawa v. Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996), the Court 

of Appeals for the Federal Circuit upheld a PTO decision that in vitro testing of a novel 

pharmaceutical compound was sufficient to establish practical utility, stating the following rule: 

[T]esting is often required to establish practical utility. But the test results need 
not absolutely prove that the compound is pharmacologically active. All that is 
required is that the tests be "reasonably indicative of the desired 
[pharmacological] response." In other words, there must be a sufficient 
correlation between the tests and an asserted pharmacological activity so as to 
convince those skilled in the art, to a reasonable probability, that the novel 
compound will exhibit the asserted pharmacological behavior." Fujikawa v. 
Wattanasin, 93 F.3d 1559, 1564, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996) (internal 
citations omitted, bold emphasis added, italics in original). 
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While the Fujikawa case was in the context of utility for pharmaceutical compounds, the 
principals stated by the Court are applicable in the instant case where the asserted utility is for a 
therapeutic and diagnostic use - utility does not have to be established to an absolute certainty, 
rather, the evidence must convince a person of skill in the art "to a reasonable probability." In 
addition, the evidence need not be direct, so long as there is a "sufficient correlation" between 
the tests performed and the asserted utility. 

The Court in Fujikawa relied in part on its decision in Cross v. Ilzuka, 753 F.2d 1040, 

224 U.S.P.Q. 739 (Fed. Cir. 1985). In Cross, the Appellant argued that basic in vitro tests 

conducted in cellular fractions did not establish a practical utility for the claimed compounds. 

Appellant argued that more sophisticated in vitro tests using intact cells, or in vivo tests, were 

necessary to establish a practical utility. The Court in Cross rejected this argument, instead 

favoring the argument of the Appellee: 

[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. [Appellee] has not urged, 
and rightly so, that there is an invariable exact correlation between in vitro test 
results and in vivo test results. Rather, [Appellee's] position is that successful in 
vitro testing for a particular pharmacological activity establishes a significant 
probability that in vivo testing for this particular pharmacological activity will be 
successful. Cross v. Ilzuka, 753 F.2d 1040, 1050, 224 U.S.P.Q. 739 (Fed. Cir. 
1985) (emphasis added). 

The Cross case is very similar to the present case. Like in vitro testing in the 
pharmaceutical industry, those of skill in the field of biotechnology rely on the reasonable 
correlation that exists between gene expression and protein expression (see below). Were there 
no reasonable correlation between the two, the techniques that measure gene levels such as 
microarray analysis, differential display, and quantitative PCR would not be so widely used by 
those in the art. As in Cross, Applicants here do not argue that there is "an invariable exact 
correlation" between gene expression and protein expression. Instead, Applicants' position 
detailed below is that a measured increase in gene expression or gene amplification in cancer 
cells establishes a "significant probability" that the encoded polypeptide will also be 
overexpressed in cancer based on "a reasonable correlation therebetween". 
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Taken together, the legal standard for demonstrating utility is a relatively low hurdle. An 
Applicant need only provide evidence such that it is more likely than not that a person of skill 
in the art would be convinced, to a reasonable probability, that the asserted utility is true. 

The evidence need not be direct evidence, so long as there is a reasonable correlation between the 
evidence and the asserted utility. The standard is not absolute certainty. 

Even assuming that the PTO has met its initial burden to offer evidence that one of 
ordinary skill in the art would reasonably doubt the truth of the asserted utility, Applicants assert 
that they have met their burden of providing rebuttal evidence such that it is more likely than not 
those skilled in the art, to a reasonable probability, would believe that antibodies to the PR0539 
polypeptide are useful as a diagnostic tool for cancer. 

Applicants have established that the Gene Encoding the PRQ539 Polypeptide is Amplified in 
Lung and Colon Tumors compared to Normal Tissue and is Useful as a Diagnostic Tool 

Applicants first address the PTO's argument that the level of overexpression of nucleic 
acid encoding PR0539 was minimal and insignificant. Applicants submit that the gene 
amplification data provided in the present application are sufficient to establish a specific and 
substantial utility for the gene encoding the PR0539 polypeptide, as well as antibodies to the 
PR0539 polypeptide. 

Applicants previously submitted the declaration of Dr. Audrey Goddard with exhibits 
A-G. In her declaration, Dr. Goddard states that a 2-fold increase in gene copy number, i.e., a 
ACt value of 1, is "significant and useful" in detecting cancerous tumors or the diagnosis of 
cancer. Goddard Declaration, paragraph 7. The nucleic acid encoding the PR0539 polypeptide 
has a value of 1 or greater in several tumor samples tested. Thus, the differential expression of 
the nucleic acid encoding PR0539 can be used to distinguish cancerous tissue from normal 
tissue. 

In the present Office Action, the PTO has not offered any reason to reject Dr. Goddard's 
declaration. Applicants remind the PTO that the applicant need not provide evidence such that it 
establishes an asserted utility "as a matter of statistical certainty." M.P.E.P. at § 2107.02, part 
VII (2004). Applicants therefore submit that they have established that the gene amplification 
data reported in Example 16 are significant, and the utility for the PR0539 DNA in 
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distinguishing between normal and cancerous tissue has been established. For the reasons 
discussed below, this leads to utility for antibodies to the PR0539 polypeptide as well. 

Applicants have established that the Accepted Understanding in the Art is that there is a 
Reasonable Correlation between Gene Amplification and Overexpression of the Encoded Protein 

Applicants next address the PTO's argument that the invention lacks utility because the 
overexpression of the nucleic acid is not relevant to the utility of the protein and antibodies, and 
there is no evidence that the protein is overexpressed. The PTO cites Pennica et al (Proc. Natl. 
Acad. Sci. (1998) 95:14717-14722) for the proposition that there is no necessary connection 
between the amount of DNA in a cell and the amount of mRNA in a cell. The PTO also cites 
Meric et al (Molecular Cancer Therapeutics (2002) 1:971-79) and Gokman-Polar (Cancer 
Research (2001) 61:1375-81) to support its position that there is no necessary correlation 
between mRNA levels and protein levels. The PTO concludes that because there is no necessary 
connection between gene amplification and mRNA, and between mRNA and protein, any 
evidence of overexpression of one component does not provide utility for the protein. 

As discussed above, evidence of utility does not have to be to an absolute certainty, and 
therefore there does not need to be a necessary connection between gene amplification and 
protein expression. Rather, there need only be a reasonable correlation between the evidence 
offered and the asserted utility such that it is more likely than not that a person of skill in the art 
would be convinced, to a reasonable probability, that the asserted utility is true. 

Applicants submit that those of skill in the art would recognize that there is a reasonable 
correlation between amplification of a gene and an increase in gene expression. This assertion is 
supported by numerous references. Orntoft et al. {Molecular and Cellular Proteomics, 1:37-45 
(2002); submitted herewith as Exhibit 1) studied transcript levels of 5600 genes in malignant 
bladder cancers which were linked to a gain/loss of chromosomal material using an array-based 
method. Orntoft et al showed that there was a gene dosage effect and teach that "in general (18 
of 23 cases) chromosomal areas with more than 2-fold gain of DNA showed a corresponding 
increase in mRNA transcripts." Orntoft at 37, column 1, abstract. In addition, Hyman et al 
{Cancer Research, 62:6240-6245 (2002); submitted herewith as Exhibit 2) used CGH analysis 
and cDNA microarrays to compare DNA copy numbers and mRNA expression of over 12,000 
genes in breast cancer tumors and cell lines. They showed that there is "evidence of a prominent 
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global influence of copy number changes on gene expression levels." Hyman at 6244, column 1, 
last paragraph. 

Additional supportive teachings are also provided by Pollack et ah (PNAS, 99:12963- 
12968 (2002); submitted herewith as Exhibit 3) who studied a series of primary human breast 
tumors and found that "[b]y analyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct effect on global gene expression 
patterns in both breast cancer cell lines and tumors." Pollack at 12967 at column 1, emphasis 
added. Their study found that "62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene expression across a wide range of 
DNA copy number alterations (deletion, low-, mid- and high-level amplification), that on 
average, a 2-fold change in DNA copy number is associated with a corresponding 1.5-fold 
change in mRNA levels." (Pollack at 12963, column 1, abstract). 

Together, these articles collectively teach that it is more likely than not that gene 
amplification increases mRNA expression. This evidence establishes that there is a reasonable 
correlation between gene amplification and gene expression, and one of skill in the art would 
believe, to a reasonable probability, that gene amplification would lead to increased gene 
expression. 

Relying on a single contrary example of one gene, the PTO states that the literature 
reports that it does not necessarily follow that an increase in gene copy number results in 
increased gene expression and increased polypeptide expression. The PTO focuses on a 
statement from the abstract of Pennica that the WISP-2 gene DNA was amplified in colon 
tumors, but RNA expression was reduced. Pennica at 14717. This inverse correlation is in 
contrast to the WISP-1 gene, which was amplified and had higher RNA levels. The authors of 
Pennica offer an explanation for what they obviously viewed as an anomalous result: "Because 
the center of the 20ql3 amplicon [of which WISP-2 is a part] has not yet been identified, it is 
possible that the apparent amplification observed for WISP-2 may be caused by another gene in 
this amplicon." Id. at 14722, emphasis added. Thus, the example of a lack of positive 
correlation between gene amplification and RNA levels relied on by the PTO may not even be 
real. The fact that the authors attempt to explain this anomaly only supports Applicants' 
argument that the accepted understanding in the art is that there is a direct correlation between 
gene amplification and an increase in gene expression. 
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As stated above, the standard for utility is not absolute certainty, but rather whether one 
of skill in the art would be more likely than not to believe the asserted utility. Even if Pennica 
supported the PTO's argument, which it does not, one contrary example is not sufficient to prove 
that a person of skill in the art would have a reasonable doubt that gene amplification is not 
correlated to gene expression. Given the evidence provided by the Applicants which establishes 
that there is a reasonable correlation between gene amplification and mRNA expression, one of 
skill in the art would believe, to a reasonable probability, that the reported amplification of the 
PR0539 gene would lead to an increase in the level of PR0539 mRNA. 

Applicants next address the PTO's argument that there is no necessary correlation 
between mRNA levels and protein levels. 

Applicants have previously submitted a copy of a Declaration by J. Christopher Grimaldi, 
an expert in the field of cancer biology. As stated in paragraph 5 of the declaration, "Those who 
work in this field are well aware that in the vast majority of cases, when a gene is over- 
expressed... the gene product or polypeptide will also be overexpressed." Similarly, the 
previously submitted declaration of Paul Polakis, Ph.D., an expert in the field of cancer biology 
states that "it remains a central dogma in molecular biology that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." Polakis Declaration, 
paragraph 6. He cites as supporting evidence not only his years of personal experience, but also 
results from experiments related to the present application. He reports that for the mRNAs 
overexpressed in cancer that have been examined, 80% had correspondingly higher levels of the 
encoded protein. Polakis Declaration at paragraphs 4 and 5. 

The statements of Grimaldi and Polakis are supported by the teachings in Molecular 
Biology of the Cell, a leading textbook in the field (Bruce Alberts, et al, Molecular Biology of 
the Cell (4 th ed. 2002) submitted herewith as Exhibit 4). Figure 6-3 on page 302 illustrates the 
basic principle that there is a correlation between increased gene expression and increased 
protein expression. The accompanying text states that "a cell can change (or regulate) the 
expression of each of its genes according to the needs of the moment - most obviously by 
controlling the production of its mRNA'" Molecular Biology of the Cell at 302, emphasis added. 
Similarly, figure 6-90 on page 364 illustrates the path from gene to protein. The accompanying 
text states that while potentially each step can be regulated by the cell, "the initiation of 
transcription is the most common point for a cell to regulate the expression of each of its genes." 
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Molecular Biology of the Cell at 364. This point is repeated on page 379, where the authors state 
that of all the possible points for regulating protein expression, "[f]or most genes transcriptional 
controls are paramount." Molecular Biology of the Cell at 379. 

Together, the declarations of Grimaldi and Polakis, the accompanying references, and the 
excerpts from the Molecular Biology of the Cell all establish that the accepted understanding in 
the art is that there is a reasonable correlation between gene expression and the level of the 
encoded protein. Applicants have demonstrated the increased expression of the gene encoding 
PR0539, and have provided sufficient evidence to show that there is a reasonable correlation 
between expression of the gene and the level of PR0539 protein. 

In arguing against this assertion, the PTO cites two references. The PTO relies on a 
statement from Gokman-Polar that "PKC mRNA levels do not directly correlate with PKC 
protein levels." Office Action at 3-4. However, a close review of the entire article indicates that 
with one exception, the trend in the data is that mRNA and protein levels are positively 
correlated, supporting Applicants assertion. In Figure 2, the protein level of two isozymes shows 
a decrease, while the third is increased. This same pattern is seen for the corresponding mRNA 
levels in Figure 6, although admittedly the increase in mRNA for the third isozyme is minimal. 
Similarly, comparing the protein levels of the three isozymes in Figure 4 to the corresponding 
mRNA levels in Figure 7, with one exception the mRNA levels are positively correlated to 
protein levels. While protein levels do not increase or decrease in direct proportion to the 
changes in mRNA, the trend in five of the six examples is that protein levels are positively 
correlated to mRNA levels. This evidence is hardly sufficient to establish that one of skill in the 
art would reasonably doubt that there is a reasonable correlation between mRNA levels and 
protein levels. 

The Meric article cited by the PTO offers even less support for the PTO's position. The 

PTO relies on the statement that "Gene expression is quite complicated, however, and is also 

regulated at the level of mRNA stability, mRNA translation, and protein stability." Office 

Action at 3. What the PTO ignores is the preceding statement by the authors: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the level 
of mRNA, which can be attributable to either DNA amplification or to differences 
in transcription. Meric at 971 (emphasis added). 
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This statement supports Applicants' asserted utility. It is true that there is no necessary 
correlation between gene expression and protein expression because there are other mechanism 
for regulating gene expression. However, were there no significant correlation between gene 
expression and protein levels, exploiting differences in gene expression between cancer cells and 
normal cells would not be a "fundamental principle of molecular therapeutics in cancer." 

In light of the lack of significant support for the PTO's argument, Applicants submit that 
the PTO has failed to establish a reason for one of skill in the art to doubt the asserted utility. 
Even if it has, Applicants have offered sufficient evidence to rebut the PTO's argument and 
establish that there is a reasonable correlation between gene amplification, gene expression, and 
protein expression. The PTO is again reminded that absolute predictability is not required. 
Applicants have established that it is more likely than not that one of skill in the art would be 
convinced, to a reasonable probability, that the PR0539 protein is overexpressed in certain 
cancers, and therefore antibodies to the PR0539 protein have utility as a diagnostic tool. 

The Instant Case Differs Significantly from Example 12 of the Utility Guidelines 

Applicants next address the PTO's argument that the current situation closely tracks 
Example 12 of the Utility Guidelines, because where there is no necessary relationship between 
the protein levels or utilities and a small level of mRNA overexpression in cancer cells, the 
invention lacks any "real world" context of use for PR0539. 

In Example 12, the specification discloses a protein, receptor A, which is the binding 
partner for protein X. The specification does not characterize the isolated protein with regard to 
its biological function or any disease or body condition that is associated with the isolated 
protein. In addition, the function of protein X has also not been identified. One of the asserted 
utilities for receptor A is making monoclonal antibodies to receptor A which can be used as a 
therapeutic drug to effect control over the receptor. In the analysis of this asserted utility for 
receptor A, the Utility Guidelines state that "since neither the specification nor the art of record 
disclose any diseases or conditions associated with receptor A, the asserted utility in this case 
essentially is a method of treating an unspecified, undisclosed disease or condition, which does 
not define a 'real world' context of use." Utility Guidelines at 66, emphasis added. 

The situation in Example 12 is not the situation here. Applicants have demonstrated that 
the nucleic acid encoding PR0539 is amplified in certain cancers. Thus, unlike the protein in 
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Example 12, PR0539 is associated with a disease or condition - more specifically, lung and 
colon cancer. 

The PTO asserts that because it is the nucleic acid, and not the PR0539 polypeptide 
which has been shown to be amplified in cancer cells, the PR0539 polypeptide is not associated 
with any disease. However, as discussed at length above, Applicants have demonstrated a 
reasonable correlation between gene amplification and protein expression such that one of skill 
in the art would believe, to a reasonable probability, that the PR0539 protein is overexpressed in 
certain cancers and therefore antibodies to PR0539 are useful as a diagnostic tool. 

The present situation closely resembles the caveat discussed at the end of Example 12, 
where receptor A is shown to be present on the cell membranes of melanoma cells but not on the 
cell membranes of normal skin cells. The Utility Guidelines state that in that situation, "making 
a monoclonal antibody to receptor A for diagnosing melanoma would constitute a well- 
established utility." Utility Guidelines at 70. Similarly, here Applicants have provided evidence 
that it is more likely than not that PR0539 is expressed at higher levels in certain cancer cells 
than normal tissue, and therefore it can be used to make antibodies which have utility as a 
diagnostic tool. 

The PTO 's Response to Applicants ' Expert Declarations and Arguments is Insufficient to reject 
the Applicants ' Asserted Utility 

The PTO's rejection of Applicants' expert declarations as "fundamentally flawed" 
because they fail to provide specific evidence regarding PR0539 is unwarranted. As discussed 
above, specific evidence of overexpression of PR0539 in cancer is not required. Instead, indirect 
evidence of the asserted utility of antibodies to the PR0539 polypeptide can be offered so long as 
there is a "reasonable correlation" between the proffered evidence and asserted utility, such that it 
is more likely than not that a person of skill in the art would believe the asserted utility. The 
Applicants' expert declarations establish a reasonable correlation between gene amplification and 1 
protein expression, and in view of Applicants' data showing amplification of the gene encoding 
PR0539 in lung and colon cancer, thus support the asserted utility of antibodies to PR0539 as a 
diagnostic tool for cancer. 

Similarly, the PTO's reliance on In re Kirk, 376 F.2d 936, 153 U.S.P.Q. 48 (C.C.P.A. 
1967) is also misplaced. In Kirk, the asserted utility for the claimed compounds was "a new class 
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of compounds often possessing high biological activity" and "intermediates in the preparation of 
compounds with valuable biological properties..." Id. at 1120, 1121. The Court rejected these 
statements as "nebulous expressions" of usefulness. Id, at 1 124. 

Here, Applicants have asserted a much more specific utility than "biological activity" or 
"biological properties". Applicants have provided evidence of amplification of the gene 
encoding PR0539 in certain cancers and have shown that this evidence is reasonably correlated 
to overexpression of the PR0539 polypeptide in those same cancers, namely, lung and colon 
cancer. As Example 12 of the Utility Guidelines make clear, when a protein is differentially 
expressed in cancer compared to normal tissue, the protein and antibodies have utility in 
diagnosing the cancer. This is the situation here. 

Specific Utility 

The PTO argues that even if substantial utility were found, there is no specific utility 
given for antibodies to the PR0539 protein, since antibodies to the protein, as distinguished from 
the nucleic acid, have not been associated with any disease, condition, or any other specific 
feature. Relying on the lack of correlation between levels of nucleic acid and protein cited in the 
Gokman-Polar and Meric references, the PTO argues that the overexpression of the nucleic acid 
gives no specific utility because it is entirely unrelated to uses of the protein or antibody. 
Applicants respectfully disagree. 

Specific Utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.E.P. § 2107.01, part I (2004). Applicants submit that the evidence of amplification of 
PR0539 nucleic acids in certain types of cancer cells along with the declarations and references 
discussed above provide a specific utility for the claimed antibodies. As stated above, Applicants 
have established a reasonable correlation between gene amplification and protein expression. 
This makes antibodies to the PR0539 protein useful in diagnosing lung and colon cancer. This 
is not a general utility that would apply to the broad class of antibodies. 

The amplification of PR0539 nucleic acid in certain cancer cells distinguishes this case 
from Examples 4 and 12 of the Utility Guidelines cited by the PTO. In both examples, there is 
no description of the protein beyond its sequence or its binding of an unidentified ligand. Here, 
the disclosed proteins are encoded by a nucleic acid that is amplified in certain cancer cells, 
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which is reasonably correlated to overexpression of the PR0539 polypeptide. This makes the 
utility of using antibodies to the protein to diagnose lung and colon cancer specific, since in 
general, antibodies are not specific to proteins that are overexpressed in cancer cells. 

The PTO's previous response to Applicants' arguments regarding specific utility is 
lacking. The PTO asserts that because Applicants' arguments presume the PR0539 protein is 
overexpressed, and this is not necessarily the case, this cannot serve as the foundation to support 
specific utility. However, utility need not be established "beyond a reasonable doubt" or to a 
"statistical certainty." Rather, Applicants need only establish that the asserted utility is "more 
likely than not." M.P.E.P. at § 2107.02, part VII (2004). Thus, it need not be shown that 
overexpression of PR0539 polypeptide is necessarily the case, only that it is more likely than 
not, which Applicants have done. 

Conclusion 

Given the totality of the evidence provided, Applicants submit that they have established 
a credible, substantial, and specific utility for the claimed antibodies as diagnostic tools. 
According to the M.P.E.P. and case law cited above, irrefutable proof of a claimed utility is not 
required. Rather, a specific and substantial credible utility requires only a "reasonable" 
confirmation of a real world context of use. Applicants have offered sufficient evidence to 
establish that there is a reasonable correlation between gene amplification, gene expression, and 
protein expression. Applicants have established that it is more likely than not that one of skill in 
the art would be convinced, to a reasonable probability, that based on the gene amplification data 
for the gene encoding PR0539, the PR0539 protein is overexpressed in certain cancers, and 
therefore antibodies to PR0539 have utility as a diagnostic tool. In view of the above, 
Applicants respectfully request that the PTO reconsider and withdraw the utility rejection under 
35 U.S.C. §101. 

Rejection under 35 U.S.C. SI 12 - Enablement 

The PTO rejected Claims 22-27 under 35 U.S.C. § 112, first paragraph, as containing 
subject matter which was not described in the specification in such a way as to enable one skilled 
in the art to make and/or use the invention. The PTO cites In re Wands and the factors set forth 
therein to determine the scope of enablement. However, Applicants respectfully submit that the 
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PTO's conclusions are inconsistent with the teachings of Wands, as they rest on the erroneous 
assumption that a necessary connection between gene amplification and protein expression is 
required. The PTO states that "[w]ith regard to enablement, fundamentally the same arguments 
[as those given for utility] apply, and this rejection is maintained for the same reasons as given 
above in response to the arguments on utility." Office Action at 15. 

The Applicants believe that the evidence, declarations, references, and arguments 
discussed above make clear that Applicants have established that it is more likely than not that 
one of skill in the art would be convinced, to a reasonable probability, that the PR0539 protein is 
overexpressed in certain cancers, and therefore antibodies to PR0539 have utility as a diagnostic 
tool. This use is disclosed in the application, for example at page 95, lines 9-21 of the substitute 
specification, and the techniques for the creation of antibodies are well known and routine in the 
art. Thus, at least one use of antibodies to the PR0539 polypeptide is adequately enabled, which 
is all that is required - "if any use is enabled when multiple uses are disclosed, the application is 
enabling for the claimed invention." M.P.E.P. 2164.01(c). In view of the above, Applicants 
respectfully request that the Examiner reconsider and withdraw the enablement rejection under 
35 U.S.C. § 1 12, first paragraph. 
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CONCLUSION 



In view of the above, Applicants respectfully maintain that claims are patentable and 
request that they be passed to issue. Applicants invite the Examiner to call the undersigned if any 
remaining issues may be resolved by telephone. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 11-1410. 



Respectfully submitted, 



KNOBBE, MARTENS, OLSON & BEAR, LLP 




Attorney of Record 
Customer No. 30,313 
(619) 235-8550 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

To^en F.lEfrntoftt:^ Thomas ThykjaerU, Frederic ML Waldman||, Hans Wolf**, 
and Julio E, Collsff 

partly because of technical Bmttattons. Sere we have™ - T 9 .T ex P resslon m 

tempted to address this question In pairs of non-Invasive t * individual genes In solid tumors 

and invasive human bladder tumors using a combination revealed a good correlation between gene dose and 

of technology that included comparative genomic hybrid- mRNA or P rote,n tevete ln t" 6 «se of c-erb-B2. cyciln <Ji, 

teatlon, high density oligonucleotide array-based monitor- ems1 • 80(3 N " m y c (3-5). However, a high cyclin D1 protein 

Ing of transcript levels (5600 genes), and high resolution expression has been observed without simultaneous am- 

two-dimensional gel electrophoresis/the results showed ^-P'Wcatlon (4), and a low level of c-myc copy number in- 

that there is a gene dosage effect W in some cases crease was observed without concomitant c-myc protein 

superimposes on other regulatory mechanisms. This ef- overexpression (6). 

lT^«!r" ded ? 1 1*??™ ma 9 nltudo <* «•» com- In human bladder tumors, karyotyping, fluorescent *i situ 

showed either reduced or unaltered transcript leveled Be- 0336 non " lnvaslve pTa transitional cell carcinomas (TCCs), 
cause most proteins resolved by two-dimensional gels Ws ,ndudes loss <* chromosome 9 or parts of ft, as well as 
are unknown It was only possible to compare mRNA and loss ^ Y ,n males « ln minimally Invasive pTI TCCs, the fol- 
Protein alterations In relatively few cases of well focused ,owin 9 alterations have been reported: 2q-, 11p-, lq+ ( 
abundant proteins, With few exceptions we found a good 1 1q13+. 17q+, and 20q+ (7-12). It has been suggested that 
correlation (p < 0.005) between transcript alterations and these regions harbor tumor suppressor genes and onco- 
protein levels. The implications, as well as limitations, genes; however, the large chromosomal areas Involved often 
P^Z^"^ JE*** M ° /eCU/ar * Ce//W/ar contain many genes, making meaningful predictions of the 
m/CS 1:37 ^ 20011 functional consequences of losses and gains very difficult. 

In this investigation we have combined gehome-wide tech- 

Aneuploldy is a common feature of most human no,ogy for d0tectlfl 9 fletiomlc gains and losses (CGH) with 

teomlcs) to determine the effect of gene copy number on 
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IUCSF Cancer Center and Department of Laboratory Medicine, UnJ- Mater/a/- Bladder tumor biopsies were sampled after Informed 

versJty of California, San Francisco, CA 94143-0808, and institute consent obtained and after removal of tissue for routine paihoJ- 

of Medical Biochemistry and Danish Centre for Human Genome Re- °W anamination. By ti£ht microscopy tumors 335 and 532 were 

search, Ole Worms Afle 170, Aarhus University, DK-8000 Aarhus C, 8ta 9 ed *V an experienced pathologist as pTa (superficial papiOary) 

Denmark , 
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« J^L 0 ^ numb€i " mRNA «xpres$lon level Shown from left to rfcftr am chromosome (Chr) GGH orofiles. am location 

^r^^!l^if ^ counteipart tumor 335. fl, expression of mRNA In invasive tumor 827 compared with the non-Invasive 
^ " l ! or6scent tumorDNAand normal DMA Is shown along thX^^ZSSe 

1}eft) The bokJ curve m the ratio profile represents a mean of four chromosomes and Is surrounded by thin curves ir^atinTcf^^H 

JJ^nnJl^ ^>mosonies where the non-Invasive tumor 335 used for comparison showed alterations tn DNA co^tZmtto 
S ^^IT t*%2 t0 T* * ^ ^ represents one^ e^ r ^!X^ 

ST^r^l^ name of the gene can be seen at www.MDLDK/sdata,htmQ. The bars Indlca^WnX^^ 

JT*** e 1 PreSSfon tevel ofthe * the invasive tumor compared with the fK>n-inva^^ 

ctoww^Sif^^r Wteale ^^teasthalf of the genes were up-regulated fbfec*), at least naif of the genes 

^^^r^^ a Z r0 *Z h- L < ? J?"" 8 "* unch ^<* If a gene was abs^rt In orToTme sampled pre^ 

dTeSr^ 83 2 " f0ld Chan9e * 2 - foW ^ was <*™*« ^ this corresponded to one standaroXia^ rn^oubS 

determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis 



Grade land II, respectively, tumors 733 and 827 were staged as pTI 
Orrvasrve Into submucosal 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation-Tissue biopsies, obtained fresh from surgery, 
were embedded Immediately In a sodrum-guanldintum thiocyanate 
solution and stored at -80 «C. Total RNA was Isolated using the 
RNAzol b RN/\ Isolation method (WAK-Chemie Medical GMBH), 

P< 2X? J RNA ^ ***** 30 ^90(01) selection step (Oligotex 
mRNA kit; Qiagen). . 

cWA Preparation ~1 M g of mRNA was used as starting material. 
The first and second strand cONA synthesis was performed using the 
Superscript® choice system (Invltrogen) according to the manufac- 
turer's Instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® In vitro transcription kit (Ambion). BlotlrMabeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs In the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Array hybridization and scan* 
nlng was modified from a previous method (13). 10 of cRNA was 
fragmented at 94 °C for 35 mln In buffer containing 40 mM Tris 
acetate, pH 8.1, 100 mM KOAc. 30 mM MgOAc. Prior to hybridization 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI,' 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 mln' 
subsequently cooled to 40 °C, and loaded onto the Affymetrix probe 
array cartridge. The probe array was then Incubated for 16 h at 40 °C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes In 6X SSP&T at 25 *C followed by 4 washes In Q.5X SSPE-T 
at 50 °C. The biotinyiated cRNA was, stained, wfth a streptavldin- 
phycoerythrin conjugate, 10 ftg/ml (Molecular Probes) in 6x SSPE-T 
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Fig. 1— continued 



for 30 min at 25 *C followed by 1 0 washes in 6 x S5PE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Affymetnx by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetnx gene 
expression analysis software. 

MicwsatetGte Ana^-MterosateOfte Analysis was performed as 
described previously (14). Microsateflilea were selected by use of 
vvwwJwbUIm.nih.gdv/genemap98, and primer sequences were ob- 
tahed from the genome data base at www.gdb.org. E>NA was extracted 
from tumor and blood and amplified by PCR In a volume of 20 *d for 35 
cycles. Trie ampfibons were denatured amle^eclroptwesedforahlnan 
ABI Prism 377. Data were collected In the Gene Scan program for 
fragment analysis. Loss of heterozyg^^ was defined as less than 33% 
clone allele detected in tumor empficons compared with Wood. 

Proteomic A/ra/ysfc-TCCs were minced Into small pieces and 
homogenized In a small glass homogenlzer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coorhassle Bdniant Blue. Pro- 
teins were Identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
ImmunoWotting, and comparison with the master two-dimensional gel 
image of human keratinocyte proteins; see Wobase,dkAxji-bin/cefls. 

CGW-Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein-labeied tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 yjQ) were 
denatured at 37 °C for 5 mln and applied to denatured normal met- 
aphase slides. Hybridization was at 37 *C for 2 days. After washing, 
the slides were counterstalned with 0.15 fxg/mJ 4,6-diamidtno-2*phe- 
nyfindote In an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein-labeied reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled norma! DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
greerured fluorescence intensity ratio profiles were calculated using 
four Images of each chromosome (eight chromosomes total) with 
normalization of the greercred fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4.6-diamWino-2-phenynndole band- 
ing patterns. Only Images showing uniform high Intensity fluores- 
cence with minimal background staining were analyzed All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matlc regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization-The CGH analysis 
identified a number, of chromosomal gains and losses in the 
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Table I 

Correlation between aJterotioris detected by CQH and by expression monitoring 

Top OGH used as Independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
^dependent variable ftf expression alteration - what CGH deviation was found). 



CGH alterations 
13 Gain 



Tumor 733 vs. $35 
Expression change clusters 



Concordance CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



10 Up-regulatfon 
0 Down-regulation 

3 No change 
10 Loss 1 Up-regulation 

5 Pown^regulation 

4 No change 



77% 



50% 



10 Gain. 8 Up-regulation 

0 town-regulation 
2 No change 

12 Loss 3 Up-regulation 

2 Down regulation 
7 No change 



Expression change clusters 



Tumor 733 vs. 335 
CGH alterations 



Concordance Expression change dusters 



Tumor 827 vs. 532 
CGH alterations 



80% 
17% 

Concordance 



16 Up-regulation 
21 Down-regulation 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



38% 
60% 



17 Up-regulation 
9 town-regulation 
21 Nochange 



10 Gain 

5 Loss 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17 No change 



59% 
33% 
81% 



two invasive tumors {stage pT1 , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33- f and and 7+, 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24+, 2q14.1-qter-, 3q12-q13.3- 6q12-q22-, 
9q344 f 11q12-q13+, 17+, and 20q11,2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown In Fig. 1. Areas with gains and 
losses deviated from the norma) copy number to some extent, 
and the average numerical deviation from normal was 0,4-fold 
In the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 In TCC 733 (Fig, 1A) and 
20q12 In TCC 827 (Fig. IB). 

mRNA Expression In Relation to DNA Copy M/mber— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-Invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments In which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases tor 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their reported locus. In 
that way It was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the Invasive versus the non-invasive counterpart Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations In these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression* For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
Increased mRNA expression levels In the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression In several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, top). The inabil- 
ity to detect RNA expression changes In these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations In these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (FCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected In areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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atton In expression. No alteration was detected, by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table \, bottom). Because the ability to observe reduced or 
Increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2K£pr both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the ONA copy number) and alteratfons detected by the array 
based technology (Fig. 2^ Similar data were obtained when 
areas with altered expression were used as Independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent, 

MfcmsateUfte-based Detection of Minor Areas of Loss- 
es—In TCC 733, several chromosomal areas exhibiting ONA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1 , TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two mlcfosatellltes positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/ho change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA dowrwegu^ 
tation may be regions that have undergone smaller losses of 
chromosomal material However, mis cannot be detected with 
the resolution of the CGH method, 

In both TCC 733 and TCC 827, the telomerlc end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two mlcrosatellites (D11S1760, 011S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and In TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellltes positioned as close as possible to the gene loci 
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Fkj. 3. MicrosateHite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocornpatlUIUy antigen (gene 
number 38 In Fig. 1), (b) by D1S2735 dose to cathepsln E (gene 
number 41 Fig. 1), artd (c) at cruon^ceor^ close 
to general 0-spectrtn (gene number 11 on Fig. 1) and of (d) tumor 827 
showing toss of heterozygosity at chromosome 18q12 by S18S1118 
close to mitochondrial 3-oxoacyl-coenzyme A thkriase (gene number 
12 In Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (N), and the tower curves show the 
electropherogram from tumor DNA (7). In all cases one allele Is 
partially lost In the tumor amplicon. 

showing reduced mRNA transcripts. Only the mlcrosatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controJIed by other mechanisms. 

Relation between Changes in mRNA and Protein Levels- 
2 DUPAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 
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Fig. 4. Correlation between protein levels as Judged by 2D- 
PAGE and transcript ratio. For comparison proteins were divided In 
three groups, unaltered In level or up- or down-regulated ficrizonta! 
axfc). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene (vertical axfc). A, mRNAs that were scored as 
present In both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent In the Invasive tumors (along horizontal axis) or 
as absent In noninvasive reference (fop of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
feff), phosphoglucomutase 1 , glutathione transferase class m number 
4, fatty acid-binding protein homologue. cytokeratin 15, and cyto- 
keratin 13; 6 (from fefl), fatty acid-blndlng protein homologue, 28-kDa 
heat shock protein, cytokeratin 13. and calcyclln; C<fromtefl), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3- e, and 
pre-mRNA splicing factor; 0, irtesothetial keratin K7 (type II); £ (from 
top), glutathione S-transferase-ir and mesomeflal keratin K7 (type II); 
F (from top and feft), adenyfyl cyclase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phosphogtycerate routase, annexln IV, cy- 
toskeletal -y^actln. hnRNP At, Integral membrane protein calnexin 
(JP90), hnRNP H f brain-type dathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear riboruicteoproteln A/B, 
translattonaJly controlled tumor protein, liver glyceraiderryoV3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na»K- 
ATPase 0-1 subunit, G, (from fop and toff}, TCP20, calgizzarin, 70- 
kOa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 1 9, trtosephosphate Isomerase, hnRNP F, Bverglyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-ir, and keratin 8; H (from left), plasma getedlin, autoantigen cal- 
reticufin, mloredoxin, and NAD+ -dependent 16 hydroxyprostaglandin 
dehydrogenase; / prom top), prolyl 4-hydroxylase 0-eubunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-Wphos- 
phatase; J annexln II; K, annexln IV; £. (from top and fefl), 90-kDa heat 
shock protein, prolyl hydroxylase /5-eubunh\ a-enolaae, GRP 78, 
cydophllin, and cofifln. 

gradient, and having a known chromosomal location, were 
selected for analysis In the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0,005) between mRNA and protein alterations (Rg. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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532 



Fig. 5. Comparison of protein and transcript levels In invasive 
and non-Invasive TCCs. The upper pert of the figure shows a 2D ge! 
(teft) and the oligonucleotide array (r&ftr) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated In TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array (red arrow) from TCC 532 
and la compared with TCC 827. The upper row of squares In each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
speclfic binding). Absence of signal Is depicted aa black, and the 
higher the signal the lighter the color. A high transcript level was 
detected In TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
Transcript level was also present In TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure QefQ show levels of PA-FABP and adipocyte- 
FABP In TCCs 335 and 733 (invasive), respectively. Both proteins are 
dowrwegutated In the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript A medium transcript level was de- 
tected In the case of TCC 335 (1277 units) whereas very low levels 
were detected In TCC 733 (166 units). /0% isoelectric focusing. 



keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 1 9 
correlated fp< 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed In the non-invasive TCC 335 but lost In the Invasive 
counterpart (TCC 733; see Rg. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that-were available.- 

11 chromosomal regions where CGH showed aberrations 
that corresponded to the changes In transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions Included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBPl! Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area In Invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression In two pairs of non-Invasive 
and invasive TCCs using high throughput expression arrays 
and proteomlcs, In combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which In some cases was 
superimposed by a DNA copy number effect In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the feet that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 



Table II 

Proteins whose expression level correlates with both mRNA and gene dose changes 



Protein 



Chromosomal location Tumor TCC CGH alteration Transcript alteration- Protein alteration 



Annexin II 
AnnexinlV 
Cytokeratin 17 
Cytokeratin 20 
(PA-JFABP 
FBP1 

Plasma gelsolin 
Heat shock protein 28 
Prohibltin 
ProlyI-4-hydroxyl 
hnRNPBI 



1q21 


733 


Gain 


Abs to Pres - 


Increase 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


17q12-q21 


827 


Gain 


3.8-Fold up 


Increase 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


9q31 


827 


Gain 


Abs to Pros 


Increase 


15q12-q13 


827 


Loss 


2,5-Fold up 


Decrease 


17q21 


827/733 


Gain 


3.7-/2.5-Fold up 6 


Increase 


17q25 


827/733 


Gain 


6.7V1. 6-Fold up 


Increase 


7p15 


827 


Loss 


2.5-Fold down 


. Decrease 



• Abs, absent; Pres. present. 

6 In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 

. detected in the non-Invasive tumor but were present at rela- 
tively high levels In areas with DNA amplifications In the Inva- 
sive tumors (e.g. In TCC 733 transcript from cellular llgand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary Units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area Indicates an increased 
likelihood of gain of chromosomal material In this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, It seems striking that the gene 
dose effects were sq clearly detectable In gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylation in tumor cells (17-19). Thus, it may be possible . 
that In chromosomes with Increased DNA copy numbers two 

. or more alleles could be demethylated simultaneously leading 
to a higher transcription ievei, whereas In chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploldy regulation of gene expression in yeast, but In this case all 
the genes were present In the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors* In pTa tumors, these include 9p- f 9q~, 1 q +, Y- 
(2, 6), and ln,pT1 tumors, 2q-,11p- f 11q~, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and respectively. Likewise, the two minimal Invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p- t often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations Indicate that the pairs of tumors used In this study 
exhibit chromosomal changes observed In many tumors, and 
therefore the findings could be of general Importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it Is only possible to get a crude picture of 
chromosomal Instability using this technique. Occasionally, 
we observed reduced transcript levels dose to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsateiiites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use ofcDNA mlcroarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an Impact on the expression level in norma) ceils and is often 
reduced Jnjumore. However, toe jelatiqn between Imprinting 
and gain of chromosomal material is not known. 

We regard It as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close, and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available ft was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression Is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

. In the few cases analyzed, mRNA, and protein levels 
showed a striking correspondence although In some cases 
we found discrepancies that may be attributed to translational 
regulation, posMranslational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translational ly inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very Important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently repotted by Ideker et a/. (26) In yeast 
Qnterestingly, our study revealed a much better correlation 
between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript) One possible 
explanation could be that by losing one allele the change In 
mRNA level Is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severaffold Increase In gene copy number resulting in a much 
. higher Impact on transcript level, the latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-foid 
alterations In transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression; and protein level Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area In chromosome 17q. Whether DNA 
copy number Is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation, 
requiring immunoidenttflcatJon and/or mass spectrometry to 
correctly Identify the protelnsTn the gels; 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments 029. Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-transiational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT 

Genetic changes underiie tumor progression and may lead to cancer- 
specific expression of critical genes. Over HOQ publications haw de-. 
; scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations In cancer, hut very few of the genes 
affected are known. Here, we performed hlgh^resolution CGH analysis on 
cDNA microarrays In breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantity the Impact of 
genomic changes on gene expression. We Identified and mapped the 
boundaries of 24 independent ampllcons, ranging in size from 0.2 to 12 
Mb, Throughout the genome, both high- and WleVel copy number 
Stages had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing overexpresslon and 10.5% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene, amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, Including the BOXB7 gene, 
the presence of which In a novel amplicon at 17q21.3 was validated in 
10.2% of primary breast Cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on d>NA microarrays revealed hundreds of 
novel genes whose overexpresslon is attributable to gene amplification. 
These genes may provide Insights to tije clonal evolution and progression 
*f breast cancer and highlight promising therapeutic targets. 

INTRODUCTION ' 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biolbgicalry distinct catego- 
ries; some of which may explain the clinical behavior of the tumors 
(1-^6). Despite tiiis progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
nmtae^ehisro. and the utility of . gene expression profiling in the 
identification of specific therapeutic taxgets remains JtoHe^r^ ^ 

Accumulation of genetic defects is thought to underlie me clonal < 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts mat 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets , for anticancer therapies, as - 
demonstrated by the clinical success of new merapies against ampli- 
fied oncogenes, such as JSRBB2 mAEGFR'Q, 8), in breast cancer and 20 recui T eilt of DNA amplification have been mapped in 
other solid tumors. Besides amplifications of known oncogenes oyer b ' rea?t wmcer by 00115 ( 9 > 10 )- However, these amplicons are often 

\ : • 5 . ^^P 00 ^ defined,^ 

-— • unknown. 




. Fig JL Impact ofgehooopy iniinbec"on"b^ t^^v^m icveis. a. percentage of 

I~5?,7 ^^f^- for ^ ^toexpresjfon were >2J84 (global upper 7% of 
fee cDNA rate) a»d <0.4826 (global lower 7% of the expression ratios). Seated 
of arnphficd and deleted genes according to expression ratios. Threshold values for 
imphfication and deletion were >U and <0.7~ • 
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We hypothesized that genome-wide identification of those gene 
expression changes that , are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH inicroarrays to: (o) determine the gjobal impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes, whose mRNAexpres- 
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5 The abbreviations used are: COH, comparative genomic rrybtidization: FISH, fluo- 
rescence in situ hybridization; RT-PCR, reverse transcription-PCR. 



OENB EXPRESSION PATTERNS IN BREAST CANCER 
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aibn is most signfficantty associated with amplification of the corre- 
sponding genomic template. 

MATERIALS ACED METHODS 



Breast. Cancer Offl line*. Fourteen breast cancer cell lines (BT-20 BT- 
47^HOC1428, Hs57St, MCF7, MDA-361, MDA.43.6, MDA-453, MDA-468, 
T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Mlcroarrays. The 
pr^aratfon and printing of the 13,824 cDNA clones <m glass slides were 

perfor med as described (1 1-13). Of these clones, 1244 represented uncharac- ™ 1 *'"" v * uuu wa? wimana wtm gene expression using the signs 
tensed expressed sequence tags, and the remainder corresponded to known ^*^<* 0). We calculated a weight, w&9 for each gene asJbUpws: 
genes. CGH experiments on cDNA mjcroarrays were done as described (14, 



were excluded torn the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number. Genes with CGH ratio >M3 (representing die upper 
S% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <p;73 (representing me lower 5%) were considered to be 
deleted 

Statistical Analysis of CGH and cDNA Mlcroarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1.43) and 0 fa 
Arnplification was correlated wim gene expression using the signaMo-noise 



15). Briefly, 20 /*g of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14^18 h wim Alul and Jtsal (Life Technol- 
ogies, In*, Rcckville, MD) and purified by phenoVcWcrxrfbrm extraction. Six 

ZmJS^ i?n5r A8 ^<™ < * MbW (AmeKham ^ *** *«* °» dcnote ** mean * *» d SDs ** ** expression 

P^da) and noma I DNA with CyS-dUTP (Amersham Pharmacia) usihg levels for amplified and nonamplified cell lines, respectively. To a^the 

2SSE^ ,5)aad of elTweight; we p^^^m tll^^ 

'SS^ J*^ 03) n^ *r° described. For the expression tations of the label vector. The probability tot a gene had a largToS 

anaj^ a standard reterertce (Universal Human Reference RNA; Strataiene, weight by random permutation than the original w^nt was ISS w 

ratete^^^ of reference RNA were, low a «0.05) indicates a strong assock^ 

W-*™ and 3.5 jxg of test mRNA wim CySHnjTP, and the ampUfication. ■ ■ g . expression ana 

. WCtt Raized on mic^ys as described (13, f 5). For both Genomic localization of cpN A Clones and Amplicon Mapping Each 

HSE • ^ °? ~ ^^ Technologies, Palo cDNA clone on the microarray was assigned to a U^SSSJZ 

^o^lZZl^^ £ at UnlgCnC Bm1d ,4, ' <A database of eenomic sequence dignmentTnfo^n 

"*T^ l itJ^ ^*^ d f0r WCtS ^ created fiw the August Wta of the u£- 

l^l^Tnft^t^Z^ T h £^* m * * e versity of CaHfomia Santa Cruz's GoldenPath database? Tbe chromosome and 

STcni * ^ ^ WteXte*. For bp positions for each cPNA clone were men retrieved by relating theseTta 
SLSLT?^ F»' rah0S WfC nonndi ^^ thc **** of the . sets, Amplicons were defined as a CGH copy number ratio >10ttiat least two 

£5^2^ 0{ f^*™**™y ^ for the expression analysis adjacent clones m two or more ceU lines or a CGH ratio >2.0 in at leTS 

!1TtT , v housekee P m e Sen**, were spotted four times onto the adjacent clones in a single cell line. The amplicon start and end positions were 

array/Low quahty measurements (i.e., copy number data with mean reference . ^ ana ena posiuons were 

intensity <I00 fluorescent units, and expression data with both test' and 6w '" ^_ t 

reference intensity < 1 00 fliwwopDt imUs aniMr. widi i^bt sjbo <50 imite) 'EE^ 
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Table I Summary of independent ampUcons in J4 breast cancer cell lines by 
' CGH microarray 



Location 



Start (Mb) 



Ipl3 

lq22 
3p!4 

7pl2.1-7pll.2 

7q31 

7q32 

8q2!.lMq2U3 
8q213 

8a233-gq24.14 

8a24.22 

9pi3 

i3g22-o31 . 
17qll 

I7ql2-a21.2 

17o21.32-q2l.33 

17q22-o23.3 

19ql3 
20qll.22 
20ql3.12 . 
20ql3.12-ql3.13 
2(^13^1332 



End (Mb) 



Size (Mb) 



13179 
171*2 
179.28 
71.94 
5162 
125.73 
140.01 
86.45 
98.45 
129.88 
15121 
38.65 
77.15 
86.70 
29.30 
39.79 
52.47 
63.81 
69.93 
40.63 
.34.59 
44,00 
46.45 
5132 



132.94 
1773 
17937 
74.66. 
60.95 
130.96 
140.68 
92.46 
103.05 
142.15 
15116 
.3915 
8138 
87.62 
3035 
42.80 
55.80 
69.70 
74.99 
41.40 
3535 
45.62 
49.43 
59J2 



.0.2 

33 
03 
2.7 
53 
51 
0.7 
6.0 
4.6 
123 
1.0 
0.6 
41 
iX9 
1.6 
3.0 
33 
5.9 
5.1 
0.8 
.13 
U 
3.0 
. 7.8 



CGH were validated, with lq21, 17ql2-q21.2j 17q22-q23, 20ql3 1 
and 20ql3.2 regions being most commonly amplified. Furthemoi* 
toe boundaries of these amplicons were precisely delineatecL In ad* 
dition, novel amplicons^ were identified at 9pl3 (38.65-39.25 MM 
and 17q2U (52,47-55,80 Mb). °* 
Direct Identification of Putative Amplification target Genes* 
The cDNA/CGH microarray technique enables the direct oorreli*. 
tion of copy number knii expression data on a gerie-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified gene? in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressedr -A view of chromosome 7 ill the I4DA^468 cell lirie 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl l-pl2 (Fig.. 34). In BT-474, the two known amplicons 
at 17ql2,and 17q22-q23 contained nuinerbus highly overex- 
pressed genes (Fig. 31?). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
. previously undescribed dependent HOXB7 
was systematically amplified (as validated by FISH, Fig. 3B, inset) 
as well as overexpressed (as verified:by RT-PCR, data not shown) 
in BT-474, UACC812, and ZR-7S-30 cells. Furthermore, this ri<ive1 



extended to include neighboring mmamplified clones (ratio, <IS). The am- 
. blicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines Was done as 
described (17). Bacterial artificial chromosome clone RP11-361K8 was la- 
beled with SpectrumOrange (Vysis, Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGfFR was obtained from Vysis. SpectrumGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue inicroarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
' ( 18 >* 1110 "s* of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NH. Specimens containing a 2-told or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere, signals, in at least 10% of me tumor cells were considered to be 
amplified.. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PGR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, WI) with 10 ng ofmRNA 
as a template. HOXB7 primers were 5'-GAGCAOAGGGACTCGGACTT-3 ' 
and 5'-GCGTCA(3GTAGCGATTGTA<W. 

* *• • ■ - • - .■ . v • , 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH miexoarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5> were overexpressed (te, belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts, with high-level expression (cDNA ratio, >10) 
showed, increased copy number (Fig. IB). L6w-level copy number 
increases and decreases were also associated, with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer AmpUcons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb* This high-resolution mapping identified 24 independent 
breast cancer ampUcons, sparining from 0.2 to-12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 





^1 Annotation of gena expression data on CGH microarray profiles. 4 genes In the 
JP 1 Lt 1 r l am P hcon b*x MDA-468 ceU line are highly expressed (red dots) and include 
the EGFR oncogene. £ several genes in the 17ql2, 17q2U, and 17q23 tmplicons In the 
BT-474 breast cancer cell line are highly overexpressed (red) and Include the HOXB7 
gene. The data labels and color coding are as Indicated for Fig. 2C Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
Increased copy number by interphase FISH using BGFR (red) and chromosome 7 
centromere probe (green) to MDA-468. (4) and HOXB7«p«Aflc' probe {red) and chro- 
mosome 17 centromere {green) to BTM74 cells (JB). 9 
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S?iK "SS^T L0W " leVCl T** ^ 811(1 l0SSeS betwcen 7 ^P^ation and poor patient prognosis. Overall 
t^JSfSL T °* eXpTeSS1Qn leVClS ° fgeneS 111 *» <* how the identlficatian of geiS Sdt 

X ^ 5? ^ "I 01 " ° D a 8^ amplification provides a powerful approach teKSS 

fesis than those ofb^level amplifications. However, the impact of genes with an imporint role in cancer as win as to 
low-level ga*sonthe dysregulation of gene expression patterns in validate putative targets for therapy developmSt * 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneoploidy and low-level gains and losses 
of chromosomal arms represent flie most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24); . - 

The CGH mioparray analysis identified 24 independent breast 
cancer ainplipons. We defined the precise boundaries for, many am- 
pifcons detected previously by chromosomal CGH (9, 10, 25, 26) arid 
also discovered novel amplicous that had not been detected previ- 
ously, presumably because of flieir small size (only 1-2 Mb) or close 
proximity to other larger atnplicons. One of these novel amplicons 
involved the homeobox gene region at 17q21 3 and led to the over- 
expression of file HOXB7 and HOXB2 genes. Hie homeodoniain 
frBbscrfction fectors Wlcnown to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 trarisfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
taraorigeaicity and angiogenesis in breast cancer (29-32). the pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and Confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. . 

We carried out a systematic search to identify genes whose 
egression levels across all 14 cell lines were attributable to 
amplification status.. Statistical analysis revealed 270 such genes 
(representing -2% of all genes pn the array), including not only 
previously , described amplified genes, such as HEX-2, MYC t 
EGFR, ribosomal protein s6 kinase, and AIB3, but also numerous 
novel genes such as NRAS~related gene (lpl3), syndecan-2 («q22), 
and bone morphogeny protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of tiie 270 gencS have not been iinpficated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that B4% of .the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we. demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map. of 24 independent 
arnplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification: Characterization of a novel amplicon at 
17q2L3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 
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Mkroarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 

Compr^h^e Cancer Center, Un^^^^^^^^^Be^. ^ay; and 'DepanmertM Genet ta a^Uneb^ 
Contributed by Patrick O.firewa Ausurt 6, 2002 



Genomic DNA copy number alterations are key genetic events in 
Redevelopment and progression of human cancers. He^e we 
bSSl , 9enom *^ l,te mlcroarray comparative genomic hybrid- 
SeisS' "l aMl ^ S „ 0f 0NA C<W ««•** variation ta 
bo^mhT^ i Uman bre3St tMm *^ We «ave profiled DNA 
Smi^l ^^r 0 " 6,691 ma »Ped human genes. In 44 
predominantly advanced, primary breast tumors and 10 breast 

^ yl ^° / n corroborate previous cytogenetic studies, the hlgh- 

«^ i^r 81 mapp,n3 of boundaries 2nd 

d*e quantitative analysis of amplicon shape provide sionificant 
hnprovement In the localization of candidate on^neJpaS 
SZZ'Z?r7T 1 mRNA ,eve,s 'eveal^he remark 
Slrinn ^i Varia,I °" 1n , Sen6 W P V number contributes to 
th£ tZL IZ^^J" tUmor ce,ls - *»ettkally. we find 
ri~J^ ^ ampnfled genes *ow moderately or highly 
express, °^. * at °NA copy number Influences gene Tex* 

fd^leZ, l^!" IT^fT 0 ' 0NA *°W " umber alterations 
i * ' J**"* ^9^^' amplification), that on average. 

iST^n * *!T: 9e *? mRNA *«"» that overall, at least 
«% of an the variation in gene expression among the breast 

monber. These finding, provide evidence that widespread DNA 
Sogre^on^f can^r < ° ntribute to the development or 

^onvcntiona^cytogenetic techniques, induding comparative 
eSSSE* l] y b * ,iz ? tion (OOH) (1). have ledto the Cfi- 

.2^*JT be,r 0f recuirent ot DN A copy number 

alteration in breast cancer cell lines and tumors (2-4). White 
f° me °f £ese regions contain known or cand^te raro«n« 

i^i ^i W ^J9^ *** to™" «Wpr«iSor genes 

Ett) 4 " 11 T ? 3 ( A 7pl3) J' relevanlgW) within 
other regions (eg., gain of lq. 8q22, and 17q22-24, and loss of 
^ remain to be identified 1 rhigh-rt^luticm^mSde 
map, delineating the boundaries of DNA copy number alter- 
cS VT K ' Sh0uW *° barton and identifi- 

er f °SS 0gen 1 and tU01 <» suppressor genes in breast 
cancer In this study, we have created such a man ustao 

in™£n^ 10 "°ft DNA copy™ m^eratSn 
in a series of breast cancer cell lines and primary tumors. 
An unresolved question is the extent to which the widespread 

S^SP. nUmber f 1 *" 8 * ^ 81,(1 others have Msntificd 
2^ a'tfr expression of genes within involved 
region*. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA miooarrak we had 
<0 scale the r£ns% 

between DNA copy number changes and gene expression; From 

www.poaj^g/cgi/doI/10. 1073/pnai.16247i999 



™ JS ^1 h3VC ld 5 nt,fied 8 ^Snificant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. p " onai 

Materials and Methods 

f"? • Pri ?. ary breast tumots we " predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcW 

D^Sn/ w 0 ^'. 18 ^ ^ens averaged at least 5o£> 
Details of individual tumors have been published (8 9) and 
are summarized in Table 1, which is published as supoSrW 
mformauon on the PNAS web she, wwvv4mas.0rg. BnSS 
Sll^ZT* ob ^ed from the American TyJe (Sh^e 
Collectiori Genomic DNA was isolated either usfag Qiaeen 
genomic DNA columns, or by phenol/chloroform e^clon 
followed by ethanol precipitation. wuwwn 

S^JUf? ™. itroarra y Hybridizations. Genomic DNA label- 
tog and hybridizations were performed essentially as descrih«t 

5 SSt*^ iiM* ^cationtlwo micreSS 
W f, abeled m 8 to,al TOhu »e of 50 micreliters aldtoe 
TOtumes of all reagents were adjusted accordingly. "Test" DNA 

LT^ 8 "' 1 ^ f h«orescentiy laWed ^y5) and 

hofenchzed to a human cDNA microarray containing 6 691 
^"rt h , u,nan genes (i.e, UniGene dusted). The 
reference" (labeled with Cy3) for each hybridization was nor- 
™ S e DNA^from a single donor. The fabrication 

»n1?A A ni,cr T ray l and tfie labeling and hybridization of 
mRNA samples have been described (8). 

A" 3 ^ 8nd Map f?*^ Hybridized arrays were scanned 
on a uenePn: scanner (Axon Instruments, Foster Qtv CA\ and 
fluwescence ratios (test/reference) cWatXu^cWS 
^^< a «at http://ninaJM.gov). FtaKS IS 
were normalized for each array by setting the average lo E 
fluorescence ratio for all array elements equal to 0. Sure? 
ments with fluorescence intensities more than 20% above back- 

, S 8b,e - DNA eopy nutnberprofil* 
that derated signfficantly from background ratios measured to 

evidence of real DNA copy number alteration (see Estimating 
Si^vfica^ of Abend Fluorescence Ratios In the sumS 
informanoa). When indicated, DNA copy number prtmWaref 

m?' 8 K " 8Vcra « e < symmetries-nearest ne^bor^ 
Map positions, for arrayed human cDNAs were assigned by 

AbbreviaUon: CGH. comparative genomic hybridization. 
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represented in the corresponding UnKJene 
cluster X10) against the "Golden Path" genome a-semblv 
(http://genome.ucsc.edu/; Oct 7, 2000 Freele). For S 

«^n« «uos (for all elements representing the same UniGe£ 
duster) are reported. For mRNA measurements, f hwresoMce 
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ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information: 
Results 

Vfeperformed CGH on 44 predominantly locally advanced, 
pnmary breast tumors and 10 breast cancer cell fines, using 
tf>NA mKroamys containing 6,691 different mappeThuS 
*j ?™ ^ Materials and Methods foTdetalls of 
microarray hybridizations). To take foU advantage of the Im- 
proved spatial resolution of army CGH, we otS I (fU,orS- 

w aTIF the 6,691 cDNAs accordin « t0 * e S SEL' 

^ ://genome.ucscedu/) genome assembly of the draft 
human genome sequences (11). [ n so doing, arrayed cDNAs not 
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dektlon. Parallel analysis of DNA from cell lines containing 
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£L I^TS? 1 v^ 6 ^toty of our method to detect single- 

SiuftiStS^" 1 ? V. <47 ' XXX) - * (^woajTor 

"•^"^f^W) gains (also see Rg. 5, which is published 
SEJS^S "formation on the PNAS web site). Fluorescence 
^ Proportional to copy number ratios, which 
were slightly underestimated, in agreement with prevtousT 

£^Vn er0US DNA c °Py n "moer alterations were 
evident in both the breast cancer cell lines and primary tumors 
ffi e tumors despite the presence ofeuploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found m every cancer cell line and tumor, and 
ehromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For eample, gains within lq, 8q, 17q, and 20q were 

£5S£« l&ET^**"" ^rcUll lines/tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as were losses within lp, 3p, 8p, and 13o f80%/24% 
80%/22%, 80%/22%, and 7Q%/U%^A { ^? n l 
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number of genomic alterations (gains and losses) was found to 
JJ^gnfficantty higher in breast tumors that were high grade (P = 
0.008), consistent with published CGH data (3), estrogen recep- 
t fL£% a * rft i p ~ 0X>4> and harboring TP53 mutations (P - 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). B 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of ampHcon 
f"f" re » most easily appreciated in the breast cancer cell line 
2>KBR3; Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 7by For each of these regions we can define the 

Pollack rt at. 



bounties of the interval recurrently amplified in the tumors we 
examined; in each ease, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurremry amplified regions on chromosomes 17 and 20, can be 
found io Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). "wwwug 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells 

A strong influence of DNA copy number on gene expression 
is evident m an examination of the pseudocolor representations 
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^ y ° umbcr *** mRNA !eveb f or genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; Le^ a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
gauss), 67% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (m addition to high-level amplification) are 
also .associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 
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breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
f tests comparing adjacent classes: cell lines, 4 x 1 x lO" 49 . 
5 X lOrM * 10-2; tumors, 1 X Mr* 1 X 10-»* 5 X 10-«\ 
1 X 10- 4 ), A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1,4- and 13-fold changes in mRNA 
level for the breast cancer cell lines and tumors, tespectively (Fig. 
J*> regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and ref lects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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tumors that could be attributed to underlying variation in DMA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. 4d). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4$. This still undoubtedly 
represents a significant underestimate, as the observed variation 
In global gene expression is affected not only by true variation in 
the expression programs of the tumor ceils themselves, but also 
by the variable presence of non-tumor cell types within clinical 



Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
Usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 
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cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes), ^ 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13)* Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips ct «/, (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective Increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et aL (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium* This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression* These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et al (15) may have 
systematically under-measured gene expression changes. In this 
regard it Is remarkable that only 14 transcripts of many thousand 
residing within unampJified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may. represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global ^e~expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving*' gene(s) 
within amplicons (16)], on a large number of samples,in addition 
to functional studies. Fourth, ithis finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
micmarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
fmding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this rinding supports 
a possible role for widespread DNA copy number alteration in 
iumorigenesis.(17, 18), beyond the amplification of specific 
oncogenes. and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread" MbaJance in gene^e^ 

stochioraetric relationships in cell metabolism and physiology 
(e.g., prpteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our finding? suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A is 
transcribed and translated much more 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 



| TRANSLATION 

FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from, the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence — a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in twof respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deojcyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6) . As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 
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Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 

302 Chapter 6 : HOW CELLS READ THE GENOME: FROM DNA TO PROTEIN 




Figure 6-39 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament. (B) Diagram Illustrating 
the self-infectious nature of the protein aggregation that Is central to prion diseases. PrP is high)/ unusual 
because the misfolded version of the protein, called PrP*, Induces the normal PrP protein it contacts to 
change Its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure is 
not infectious In this way, it cannot spread from one animal to another. (C) Drawing of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a P sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two a-helices into four 
P-$Vands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serp ell, adapted from M. Sunde et al.J. MoL Biol 273:729-739, 
1 997; D, adapted from S.B. Prusiner, Trends Biochem. Sd. 2 1 :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein In a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 
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ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subuntt binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps i through 5 are discussed in, this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactivation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
byribosomes (translation^ control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 

Summary 

The genome of a cell contains in Us DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of Us genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple he regulated, for most genes the initiation of RNA transcription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
;oy a single signal. Many others are complex and act as tiny microprocessors, 
.responding to a variety of signals that they interpret and integrate to switch the 
neighboring gene on or off. Whether complex or simple, these switching devices 



-BINDING MOTIFS IN GENE REGULATORY PROTEINS 



379 



occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other, this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type ofcelL A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene . expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these pbsttranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
iaany genes they are crucial. 
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Figure 7-66 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes. A black line marks the location 
of a CG dinucleotide In the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinucleotide. CG sequences that lie in 
regulatory sequences of genes that are 
transcribed In germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the ojher hand, tend to be lost through 
deamination of 5-methyl C toT, unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 



435 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: , 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



